Improved Music Genre Classification with Convolutional Neural Networks
نویسندگان
چکیده
In recent years, deep neural networks have been shown to be effective in many classification tasks, including music genre classification. In this paper, we proposed two ways to improve music genre classification with convolutional neural networks: 1) combining maxand averagepooling to provide more statistical information to higher level neural networks; 2) using shortcut connections to skip one or more layers, a method inspired by residual learning method. The input of the CNN is simply the short time Fourier transforms of the audio signal. The output of the CNN is fed into another deep neural network to do classification. By comparing two different network topologies, our preliminary experimental results on the GTZAN data set show that the above two methods can effectively improve the classification accuracy, especially the second one.
منابع مشابه
Deep Image Features in Music Information Retrieval
Applications of Convolutional Neural Networks (CNNs) to various problems have been the subject of a number of recent studies ranging from image classification and object detection to scene parsing, segmentation 3D volumetric images and action recognition in videos. CNNs are able to learn input data representation, instead of using fixed engineered features. In this study, the image model traine...
متن کاملLocal-feature-map Integration Using Convolutional Neural Networks for Music Genre Classification
A map-based approach, which treats 2-dimensional acoustic features using image analysis, has recently attracted attention in music genre classification. While this is successful at extracting local music-patterns compared with other frame-based methods, in most works the extracted features are not sufficient for music genre classification. In this paper, we focus on appropriate feature extracti...
متن کاملParallel Convolutional Neural Networks for Music Genre and Mood Classification
Our approach to the MIREX 2016 Train/Test Classification Tasks for Genre, Mood and Composer detection is based on an approach combining Melspectrogram transformed audio and Convolutional Neural Networks (CNN). We utilize two different CNN architectures, a sequential one, and a parallel one, the latter aiming at capturing both temporal and timbral information in two different pipelines, which ar...
متن کاملMusic Genre Classification with Paralleling Recurrent Convolutional Neural Network
Deep learning has been demonstrated its effectiveness and efficiency in music genre classification. However, the existing achievements still have several shortcomings which impair the performance of this classification task. In this paper, we propose a hybrid architecture which consists of the paralleling CNN and Bi-RNN blocks. They focus on spatial features and temporal frame orders extraction...
متن کاملMusic Genre Classification Using Convolutional Neural Network2014.10.21.docx
Feature extraction is a crucial part of many MIR tasks. Many manual-selected features such as MFCC have been applied to music processing but they are not effective for music genre classification. In this work, we present an algorithm based on spectrogram and convolutional neural network (CNN). Compared with MFCC, the spectrogram contains more details of music components such as pitch, flux, etc...
متن کامل